A higher order estimate of the optimum checkpoint interval for restart dumps
نویسندگان
چکیده
منابع مشابه
A higher order estimate of the optimum checkpoint interval for restart dumps
This paper examines methods of approximating the optimum checkpoint restart strategy for minimizing application run time on a system exhibiting Poisson single component failures. Two different models will be developed and compared. We will begin with a simplified cost function that yields a first-order model. Then we will derive a more complete cost function and demonstrate a perturbation solut...
متن کاملA Model for Predicting the Optimum Checkpoint Interval for Restart Dumps
As the run time of an application approaches the the mean time to interrupt (MTTI) for the system on which it is running, it becomes necessary to generate intermediate snapshots of the application’s run state, known as checkpoint files or restart dumps. In the event of a system failure that halts program execution, these snapshots allow an application to resume computing from the most recently ...
متن کاملthe innovation of a statistical model to estimate dependable rainfall (dr) and develop it for determination and classification of drought and wet years of iran
آب حاصل از بارش منبع تأمین نیازهای بی شمار جانداران به ویژه انسان است و هرگونه کاهش در کم و کیف آن مستقیماً حیات موجودات زنده را تحت تأثیر منفی قرار می دهد. نوسان سال به سال بارش از ویژگی های اساسی و بسیار مهم بارش های سالانه ایران محسوب می شود که آثار زیان بار آن در تمام عرصه های اقتصادی، اجتماعی و حتی سیاسی- امنیتی به نحوی منعکس می شود. چون میزان آب ناشی از بارش یکی از مولفه های اصلی برنامه ...
15 صفحه اولAvida Checkpoint/Restart Implementation
As high performance computing centers (HPCC) continue to grow in popularity, issues of resource management and reliability are becoming limiting factors in scheduling long-running jobs. One of the factors is time. Efficacy of Avida, the platform of research for digital evolution, is limited by the allocated time available on HPCC. Saving the state of the software at a point and restarting the e...
متن کاملA Survey of Checkpoint/Restart Implementations
In this paper we evaluate candidates for a checkpoint/restart implementation against a common set of requirements. Overall characteristics of the two main classes of checkpoint systems, library and system, are discussed followed by specific examples from existing systems. A detailed description of two system implementations is presented. We conclude that no single publically available implement...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Future Generation Computer Systems
سال: 2006
ISSN: 0167-739X
DOI: 10.1016/j.future.2004.11.016